Vsite training with regularization by fjclark · Pull Request #89 · openforcefield/descent

fjclark · 2026-03-16T16:49:41Z

Description

This updates #73 to be compatible with recent updates to how regularisation is done, and also addresses the review comments I left there.

@JMorado you'll probably be the first to use this code, so would you be happy to briefly review? Thanks

Status

Ready to go

codecov-commenter · 2026-03-16T16:57:56Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.52%. Comparing base (40d4051) to head (0bfb99f).

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #89      +/-   ##
==========================================
+ Coverage   99.49%   99.52%   +0.03%     
==========================================
  Files          11       11              
  Lines         981     1044      +63     
==========================================
+ Hits          976     1039      +63     
  Misses          5        5

Flag	Coverage Δ
unittests	`99.52% <100.00%> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

JMorado

I'm still getting a familiar with the codebase, but overall LGTM. Left a few minor comments. Thanks!

descent/train.py

JMorado · 2026-03-20T19:11:06Z

descent/tests/test_train.py

+    # make sure we have vsites in the force field
+    assert ff.v_sites is not None
+    # this is awkward to specify in the yaml config file can we make it easier?
+    expected_ids = ["[#1:2]-[#8X2H2+0:1]-[#1:3] EP once"]


This id looks weird indeed. Can't it be simplify specified using ""[#1:2]-[#8X2H2+0:1]-[#1:3] EP"?

"once" is added here https://github.com/openforcefield/openff-interchange/blob/c2f82bb4d4beceef80deda257155bec5c5b038a0/openff/interchange/smirnoff/_virtual_sites.py#L122 (as it matches once) so the ID ends up being as displayed. I can't think of a way to get round this without making the expected id less precise, but let me know if you have any thoughts!

JMorado · 2026-03-20T19:26:20Z

descent/tests/test_train.py

+        values = trainable.to_values().detach()
+        # set the distance to outside the clamp region
+        values[-1] = 0.0
+        ff = trainable.to_force_field(values)
+        assert torch.allclose(
+            ff.v_sites.parameters[0],
+            torch.tensor([-0.0100, 3.1416, 0.0000], dtype=torch.float64),


I'm confused by this bit -- you set the distance of the last value to outside the clamp region, but then it's the first value that gets clamped to -0.01?

I've added a comment to clarify.

JMorado · 2026-03-20T19:47:28Z

descent/train.py

+                lower, upper = config.limits.get(col, (None, None))
+                clamp_lower.append(-torch.inf if lower is None else lower)
+                clamp_upper.append(torch.inf if upper is None else upper)
+


Do we need scales and clamp limits for frozen values?

No, just going off the previous implementation, but I agree without would be better. I've refactored to avoid keeping track of frozen values.

JMorado · 2026-03-20T20:06:51Z

descent/train.py

+        self._scales = torch.cat(scales)[self._unfrozen_idxs]

-        self._scales = torch.cat([param_scales, attr_scales])[self._unfrozen_idxs]
+        self._clamp_lower = torch.cat(clamp_lower)[self._unfrozen_idxs]
+        self._clamp_upper = torch.cat(clamp_upper)[self._unfrozen_idxs]


If _prepare_values would return scales and clamp limits only for unfrozen values, there wouldn't be a need to slice here. I'm not a fan of offsetting with len(param_scales) + len(attr_scales))) at it seems a bit fragile.

Thanks, yes agree this feels brittle. I've refactored to avoid this -- I now do:

offset = 0 for block in blocks: all_values.append(block.values) all_unfrozen_idxs.append(block.unfrozen_idxs + offset) all_scales.append(block.scales) all_clamp_lower.append(block.clamp_lower) all_clamp_upper.append(block.clamp_upper) all_regularized_idxs.append(block.regularized_idxs + offset) all_regularization_weights.append(block.regularization_weights) offset += len(block.values)

j-wags · 2026-03-23T23:11:40Z

This just came up at our iteration planning - Since OpenFF is the current code owner here (so to keep merging from being ambiguous) I'm assigning Lily just so she can help coordinate merging this after the review, but I'll assume this is waiting on Joao's review before that.

fjclark · 2026-03-25T11:02:37Z

Thanks for the review @JMorado -- I think I've addressed all of your comments and have added a fairly large refactor of train.py which avoids keeping track of frozen values and trys to make the offsetting less brittle.

jthorton and others added 7 commits August 30, 2024 10:18

include vsite fitting in the trainer

438e5d2

Merge branch 'main' into vsite-training

c441511

Merge branch 'main' into vsite-training

c116a12

Allow regularization of vsite parameters

58deeec

Fix type: interachange

8246eca

Avoid hard-coding vsite parameter col names

9a7744c

Reduce duplication between preparation with/ without vsites

e05bc9f

JMorado reviewed Mar 20, 2026

View reviewed changes

j-wags assigned lilyminium Mar 23, 2026

lilyminium mentioned this pull request Mar 24, 2026

Vsite training #73

Closed

1 task

fjclark added 3 commits March 25, 2026 10:26

Add comment explaining vsites clamp test

a869d83

Update docstring

396e92c

Don't keep track of frozen value clamp and scales

0bfb99f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vsite training with regularization#89

Vsite training with regularization#89
fjclark wants to merge 10 commits intomainfrom
vsite-training-with-regularization

fjclark commented Mar 16, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Mar 16, 2026 •

edited

Loading

Uh oh!

JMorado left a comment

Uh oh!

Uh oh!

JMorado Mar 20, 2026

Uh oh!

fjclark Mar 25, 2026

Uh oh!

JMorado Mar 20, 2026

Uh oh!

fjclark Mar 25, 2026

Uh oh!

JMorado Mar 20, 2026

Uh oh!

fjclark Mar 25, 2026

Uh oh!

JMorado Mar 20, 2026

Uh oh!

fjclark Mar 25, 2026

Uh oh!

j-wags commented Mar 23, 2026

Uh oh!

fjclark commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

fjclark commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Status

Uh oh!

codecov-commenter commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

JMorado left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

j-wags commented Mar 23, 2026

Uh oh!

fjclark commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

fjclark commented Mar 16, 2026 •

edited

Loading

codecov-commenter commented Mar 16, 2026 •

edited

Loading